Exploring Neighborhoods in the Metagenome Universe

نویسندگان

  • Kathrin P. Aßhauer
  • Heiner Klingenberg
  • Thomas Lingner
  • Peter Meinicke
چکیده

The variety of metagenomes in current databases provides a rapidly growing source of information for comparative studies. However, the quantity and quality of supplementary metadata is still lagging behind. It is therefore important to be able to identify related metagenomes by means of the available sequence data alone. We have studied efficient sequence-based methods for large-scale identification of similar metagenomes within a database retrieval context. In a broad comparison of different profiling methods we found that vector-based distance measures are well-suitable for the detection of metagenomic neighbors. Our evaluation on more than 1700 publicly available metagenomes indicates that for a query metagenome from a particular habitat on average nine out of ten nearest neighbors represent the same habitat category independent of the utilized profiling method or distance measure. While for well-defined labels a neighborhood accuracy of 100% can be achieved, in general the neighbor detection is severely affected by a natural overlap of manually annotated categories. In addition, we present results of a novel visualization method that is able to reflect the similarity of metagenomes in a 2D scatter plot. The visualization method shows a similarly high accuracy in the reduced space as compared with the high-dimensional profile space. Our study suggests that for inspection of metagenome neighborhoods the profiling methods and distance measures can be chosen to provide a convenient interpretation of results in terms of the underlying features. Furthermore, supplementary metadata of metagenome samples in the future needs to comply with readily available ontologies for fine-grained and standardized annotation. To make profile-based k-nearest-neighbor search and the 2D-visualization of the metagenome universe available to the research community, we included the proposed methods in our CoMet-Universe server for comparative metagenome analysis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Exploring the Relationships between Spatial and Demographic Parameters and Urban Water Consumption in Esfahan Using Association Rule Mining

In recent years, Iran has faced serious water scarcity and excessive use of water resources. Therefore, exploring the pattern of urban water consumption and the relationships between geographic and demographic parameters and water usage is an important requirement for effective management of water resources. In this study, association rule mining has been used to analyze the data of municipal w...

متن کامل

Draft Genome Sequence of a Bacillus Bacterium from the Atacama Desert Wetlands Metagenome

We report here the draft genome sequence of a Bacillus bacterium isolated from the microflora of Nostoc colonies grown at the Andean wetlands in northern Chile. We consider this genome sequence to be a molecular tool for exploring microbial relationships and adaptation strategies to the prevailing extreme conditions at the Atacama Desert.

متن کامل

An experimental metagenome data management and analysis system

The application of shotgun sequencing to environmental samples has revealed a new universe of microbial community genomes (metagenomes) involving previously uncultured organisms. Metagenome analysis, which is expected to provide a comprehensive picture of the gene functions and metabolic capacity for microbial communities, needs to be conducted in the context of a comprehensive data management ...

متن کامل

The Qur’an and Mysticism On The Universe of Two Sides

The universe has an exterior known as the material world and an interior called the “hereafter.” So are the Qur’an and mankind. In other words, the universe, mankind and the Qur’an have hierarchical stages. The universe and mankind comprise the physical world, but the Qur’an constitutes the Divine Law. Since the origin of all three is one single source, one can say the former matches t...

متن کامل

Noether Symmetry in f(T) Theory at the anisotropic universe

As it is well known, symmetry plays a crucial role in the theoretical physics. On other hand, the Noether symmetry is a useful procedure to select models motivated at a fundamental level, and to discover the exact solution to the given lagrangian. In this work, Noether symmetry in f(T) theory on a spatially homogeneous and anisotropic Bianchi type I universe is considered. We discuss the Lagran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 15  شماره 

صفحات  -

تاریخ انتشار 2014